Skip to main content
Scour
Browse
Getting Started
Login
Sign Up
You are offline. Trying to reconnect...
Close
You're currently offline. Some features may not work.
Close
Copied to clipboard
Close
Unable to share or copy to clipboard
Close
🎯 Reinforcement Learning
Q-learning, Policy Gradient, Reward Functions, TD Learning
Filter Results
Timeframe
Fresh
Past Hour
Today
This Week
This Month
Feeds to Scour
Subscribed
All
Scoured
82978
posts in
425.8
ms
Hybrid neural–cognitive models reveal how memory
shapes
human
reward
learning
nature.com
·
40m
🧠
Cognitive Science
On
Computation
and
Reinforcement
Learning
arxiv.org
·
1d
💬
Prompt Engineering
Distributional
Reinforcement Learning with Diffusion Bridge
Critics
arxiv.org
·
1d
💬
Prompt Engineering
Personalized Adaptive Feedback System for Early Detection and Intervention of Fine‑Motor Skill Development in
Preschool
Children Using Wearable
IMU
Sensors and Reinforcement Learning
freederia.com
·
1d
🗣️
LLMs
Deep reinforcement learning-based energy scheduling for green buildings with
stationary
and EV batteries of heterogeneous
characteristics
sciencedirect.com
·
14h
💬
Prompt Engineering
Part 5: Reward Engineering: How to Shape
Behaviors
in
Financial/Robotic
Tasks
dev.to
·
1d
·
Discuss:
DEV
💬
Prompt Engineering
i10e-lab/HelloRL
: A fully modular framework to make Reinforcement Learning quick and easy
github.com
·
13h
·
Discuss:
Hacker News
💬
Prompt Engineering
Continual
learning and the post
monolith
AI era
baseten.co
·
11h
·
Discuss:
Hacker News
💬
Prompt Engineering
Why
reinforcement
learning breaks at scale, and how a new method
fixes
it
techxplore.com
·
2d
💬
Prompt Engineering
Hybrid Model‑Based / Model‑Free Reinforcement Learning for Energy‑Efficient Autonomous Warehouse Robot Navigation with Real‑Time
Obstacle
Prediction **
Abstra
...
freederia.com
·
1d
💬
Prompt Engineering
Hypernetworks
: Neural Networks for
Hierarchical
Data
blog.sturdystatistics.com
·
1d
·
Discuss:
Hacker News
🧠
Machine Learning
Multi-Agent Reinforcement Learning (
MARL
): Practical Guide to
Cooperative
and Competitive Learning
dev.to
·
1d
·
Discuss:
DEV
💬
Prompt Engineering
Exploiting
large language model with reinforcement learning for generative job
recommendations
eurekalert.org
·
1d
🗣️
LLMs
Proposal: A Framework for
Discovering
Alien Physics via Optimal
Compression
lesswrong.com
·
15h
💬
Prompt Engineering
Finding all the roots of a
polynomial
using the
QR
algorithm
johndcook.com
·
10h
🤖
AI
Routing
in a
Sparse
Graph: a Distributed Q-Learning Approach
towardsdatascience.com
·
3d
💬
Prompt Engineering
Predicting
operators
reliability
for control room alarm management using knowledge-based Bayesian networks
sciencedirect.com
·
19h
🧠
Machine Learning
Rethinking
imitation
learning with Predictive
Inverse
Dynamics Models
microsoft.com
·
1d
🤖
AI
Exit
Strategy
joelchrono.xyz
·
6h
🌊
Stress Management
Your Agent Is
Slow
Because of
Inference
futureagi.com
·
18h
·
Discuss:
DEV
💬
Prompt Engineering
Loading...
Loading more...
Page 2 »
Keyboard Shortcuts
Navigation
Next / previous item
j
/
k
Open post
o
or
Enter
Preview post
v
Post Actions
Love post
a
Like post
l
Dislike post
d
Undo reaction
u
Recommendations
Add interest / feed
Enter
Not interested
x
Go to
Home
g
h
Interests
g
i
Feeds
g
f
Likes
g
l
History
g
y
Changelog
g
c
Settings
g
s
Browse
g
b
Search
/
Pagination
Next page
n
Previous page
p
General
Show this help
?
Submit feedback
!
Close modal / unfocus
Esc
Press
?
anytime to show this help